Modular neural networks for low-complex phoneme recognition
نویسنده
چکیده
We present a Modular Neural Network (MNN) for phoneme recognition within the framework of a hybrid system (neural networks and HMMs) for speakerindependent single word recognition. With this approach, we are taking the computational effort into account which is used as an additional criterion for assessing the system performance. The main idea of the proposed MNN is the distribution of the complexity for the phoneme classification task on a set of modules. Each of these modules is a single neural network which is characterized by its high degree of specialization. The number of interfaces, and therewith the possibilities for infiltering external acoustic-phonetic knowledge, increases for a modular architecture. Moreover, after the development of a suitable topology for the MNN, each of the modules can be optimized for its specific phoneme recognition task. This is done by detecting and pruning irrelevant input parameters and leads to a more efficient system in terms of memory and computational requirements.
منابع مشابه
Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM
Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...
متن کاملمعرفی شبکه های عصبی پیمانه ای عمیق با ساختار فضایی-زمانی دوگانه جهت بهبود بازشناسی گفتار پیوسته فارسی
In this article, growable deep modular neural networks for continuous speech recognition are introduced. These networks can be grown to implement the spatio-temporal information of the frame sequences at their input layer as well as their labels at the output layer at the same time. The trained neural network with such double spatio-temporal association structure can learn the phonetic sequence...
متن کاملA novel fast learning algorithms for time-delay neural networks
To counter the drawbacks that Waibel 's time-delay neural networks (TDW) take up long training time in phoneme recognition, the paper puts forward several improved fast learning methods of 1PW. Merging unsupervised Oja's rule and the similar error back propagation algorithm for initial training of 1PhW weights can effectively increase convergence speed, at the same time error firnction almost m...
متن کاملGenetic Algorithms for the Design of Fuzzy Neural Networks
The paper presents a methodology for designing the structure of a fuzzy neural network in a multi-modular connectionist system for classification purposes and illustrates the methodology on the task of phoneme recognition of the 43 phonemes in New Zealand English. The results show that by using this methodology the recognition rate can be improved significantly when compared to the recognition ...
متن کاملNovel Objective Function for Improved Phoneme Recognition Using Time-delay Neural Networks. Vii. Conclusion and Future Work Iv. Phoneme and Viseme Coding
In this paper we show how recognition perfor-mance in automated speech perception can be significantlyimproved by additional Lipreading, so called “speech-read-ing”. We show this on an extension of an existing state-of-the-art speech recognition system, a modular MS-TDNN. Theacoustic and visual speech data is preclassified in two sepa-rate front-end phoneme TDNNs and com...
متن کامل